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SYSTEM THEORY AS APPLIED 


DIFFERENTIAL GEOMETRY 


By Robert Hermann 
Division of Applied Sciences 
Harvard Un iver s it y 


SUMMARY 


The opening sections sketch the mathematical framework for System Theory. 

It has long been recognized that there are (at least) three levels at which it 
is necessary to study systems: Individual systems, systems depending on para- 

meters, and "adaptive" or "self-tuning" systems. Systematic and thorough 
mathematical study at the second level is no doubt necessary for successful 
future attack on the third level; it is in this study that differential- 
geometric methodology becomes most useful, indeed, probably essential. It is 
also pointed out that some of the most basic and important problems in System 
Theory (even at the practical level) involve properties of orbits and orbit 
spaces of certain types of Lie groups acting on manifolds. 

The core of the report is the description of a new technique for computing 
these orbits for the case of combined feedback and state equivalence groups 
acting on linear systems. Although it is equivalent to the well-known Brunovsky 
canonical form, which is an algebraic method, this technique involves differen- 
tial geometry, particularly the classification of holomorphic vector bundles 
on the Riemannian sphere due to A. Grothendieck. It also generalizes to distri- 
buted parameter systems; certain examples are discussed in a tentative way here. 
Finally, a generalization to Riemannian geometry of the "pseudo- inverse" 
construction is presented as a possible useful tool in certain identification 
problems. 


1. INTRODUCTION 


Looking at history, we can recognize that many areas of mathematics have 
developed in symbiosis with science and technology. For example, let us cite 
the development of differential calculus and differential equations in the 18th 
and 19th centuries in conjunction with analytical mechanics; the development 
of the theory of linear partial differential equations in conjunction with 
continuum mechanics and electromagnetism; in the 20th century, functional 
analysis and differential geometry have been paired with quantum mechanics 


and relativity, although the relation between the mathematics and physics has 
not been as close, direct and stimulating as in earlier times. 

A new feature in the science and technology of our day is the use of the 
hybrid scientific-technological "system", analyzed, designed and run by 
computers. One has only to think of the space program, and the analogy with 
such 19th century facts as the development of electricity and the automobile, 
to gain a historical perspective. Just as the hybrid scientific-technological 
projects of the 19th century stimulated development of the mathematical topics 
cited above, in our day there has arisen an amorphous subject called System 
and Control Theory. By its nature it sprawls over disciplinary boundaries, 
and is difficult to pin down; one cannot go to the library or to most major 
universities and systematically learn about it as one would learn, say, differ- 
ential equations or quantum mechanics. Of course there are textbooks — the 
treatises by Anderson and Moore [25] , Brockett [26] , Rosenbrock [272 and 
Wonham [28] are the best for our purposes. At the research level the journals 
IEEE Transaotions i-n Automati-o Contvot, SIAM Joiimat of Control and Optimization, 
and Automatiaa are standard ones. A striking characteristic of this material 
is its strong reliance on mathematics, in contrast to other major contemporary 
areas of science and technology. This is in the nature of the s\abject — the 
basic idea is to translate broad areas of science and technology into mathe- 
matical models that can be handled on the computer in a way that is as indepen- 
dent as possible of the origin of the models. Most of the research work 
involves analysis and probability theory, and some relatively concrete aspects 
of algebra (e.g., matrix theory, combinatorics) that were developed in the 19th 
century. (In fact, Gantmacher's treatise [6] is the only complete reference 
for some of the algebraic ideas.) However, there has also been a contact with 
differential geometry which has played a role in guiding research in the 
discipline. This report will focus on certain aspects of this interface with 
geometry, with emphasis on work that I have done with Clyde Martin on the 
geometry of linear systems. I will try to concentrate on areas which I believe 
have the greatest potential utility combined with the possibility of providing 
new research insights into System Theory. My overall thesis is that modern 
differential geometry is, among all areas of contemporary mathematics, the best 
qualified to provide new thought in diverse areas of science and technology. 

A short glance at the literature suggests that probability theory and 
matrix algebra are closely linked to system and control theory, but why differ- 
ential geometry? Uow, differential geometry is most commonly thought of as the 
study of curves and surfaces in "our" Euclidean three space. Although it has 
expanded enormously in recent times, allied at times with topology, group-theory, 
and algebraic geometry, this classical theme remains the core. The point I want 
to bring to the foreground in this report is that system and control theory also 
involve curves and surfaces — in more exotic spaces, it is true, but the method- 
ology of "modern" differential geometry has been oriented precisely to the task 
of making this generalization. 

This methodology has involved the theory of what mathematicians call 
differential manifolds . Unfortunately this is a difficult subject to learn, 
since it involves fragments from all the major mathematical disciplines. At 
the point in this report that we get down to the real work, it will be necessary 
to assume that the reader has at least some acquaintance with its principles 
and notation — the treatises by Boothby [20] , Dieudonn^ [21] , and the author [22] 
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I 


may be cited- We shall also make use of material which interfaces differential 
and algebraic geometry — the standard reference is now the very recent book by 
Griffiths and Harris [23] . 

In geometry, a manvfotd is a space that looks locally like a piece of 
Euclidean space, but globally may not be one. This means that, given any point, 
one can coordinatize a neighborhood of that point with n real numbers (n is 
the dimension of the manifold) , but that these number labels vary as the point 
moves over the space. However, as in tensor analysis, one wants to use these 
number assignments to perform the usual operations of differential and integral 
calculus. This requires that the number labels one assigns to different points 
(called aoovdinate systems) be related to each other in an infinitely diffeven- 
tiabte ("C“") way. So, a space with a topological structure to assure over-all 
consistency, and a way of attaching local coordinates to points, not uniquely, 
but changing in a C” way, is called a differentiable manifold. In modern 
mathematics, it has been found to be just about the most appropriately "general" 
geometric object to study, in the sense that it includes the class of Euclidean 
space and most of the other spaces (e-g., spheres, tori, surfaces) which appear 
most frequently, and yet its "differential" and "integral" structure can be 
studied in a unified way. 

The basic idea in differential geometry is to study geometric objects with 
analytic techniques. (The idea of "curvature" of a curve or surface is a 
familiar example.) Often, it is most convenient to mediate between the geometry 
and analysis with algebra. (In fact, it was Descartes, in the 17th century, who 
taught us to do this.) 


2. SOME GENERAL PRINCIPLES OF SYSTEM THEORY 


System theory may be thought of as the study of machines, devices, auto- 
mata, computers, physical objects, etc., from the point of view of input-output 
behavior . The famous "black box" is the key concept — a device which accepts 
certain inputs, subjects them to certain operations which depend on the "state" 
at which the box finds itself, and then puts out an output 


input 


output 


The theory can be developed in this abstract context as a theory of mappings 

<j) : (space of inputs) x (state) (space of outputs) 

However, for the purposes of geometry it is most convenient to work in the 
context of differential equations and continuous time. An input-output system 
for us will be a system of ordinary first order differential equations of the 
form: 
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dx 

dt 


f (x,u) 


( 2 . 1 ) 


y = g(x) 

Here, x = (xj , . . . ,Xj^) ' is a real column vector, i.e., an element of R^. It 
is called the state vector. u = (uj , . . . ,Ujj,) ' £ R™ is the input vector 
y = (yi» — /yp) ' £ R^ is the output vector. x ^ g(x) is observation map. 

The time parameter t runs over 0 £ t < «». We assiame that all data and 
functions are sufficiently smooth (e.g., differentiability class c“) to perform 
all needed operations of differential and integral calculus, and to use freely 
the existence-uniqueness theorems for differential equations. The symbol ’ 
stands for matrix transpose. 

Choosing u as a function u; t u(t) of t, solving (2.1) for t x(t) 

with a value xq of x(t) at t = 0, then applying the observation map g, 
gives an output curve y: t y(t) = g(x(t)). Thus, we obtain a mapping 

(curves in r”^) x R^ -> (curves in rP) 


which determines the system. The input-output relations are the set of pairs 
(u,y) of curves in rP x rP which may appear in this way for some choice of 
Xq. In practical applications we are often given only these input-output 
relations, perhaps in partial, sampled-data, or approximate form. (We put to 
the side, for this work, the whole spectnm of stochastic problems. They also 
have an interesting and important geometric structure, but this is a much less 
well developed subject.) 

This is the open loop description of a system. The alternative is the 
closed loop. A feedback strategy is a map F: X u. The solutions 

t -»■ (u(t) , x(t) ; y(t) ) 

of (2.1) such that 


u(t) = F(x(t)) 

are then defined. They are determined as solutions of the differential equations 


dx 

dt 


f tx,F(x) ) 


( 2 . 2 ) 


Of particular importance are the feedback strategies vdiich are stable in the 
sense that the system of differential equations (2.2) is stable, in one of 
the senses which are customary in differential equation theory. 
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A basic Idea in s/sbem theory is to consider a collection, 2, of systems and 
a corresponding collection 10 of input-output relations. An important case 
is that where E depends on a finite niomber of parameters; call them 


A — CA ^#.../A^5 . 


For example, E might be defined by differential equations of the following 
form: 


~ = f(x,u;A) 
dt 

y = g(x;A) 


(2.3) 


Now, two different values A, A' of the parameters might give the same input- 
output relations. Thus, the corresponding input— output relations might depend 
on fewer parameters. 


The basic object of study in "Geometric" System Theory 
are parameterized families of systems and input-output 
relations. 


It is in this study of "dependence on parameters" that modern differential 
geometry — based as it is on differential manifold theory — is so much more 
powerful than the traditional methodology. Every problem in System Theory has 
two aspects: First, study solutions for a single system, i.e., for a fixed 
value of the parameters, then see how this relation varies when the parameters 
are varied. 

For oxar purposes, it is convenient to focus on three main problems: 

A) The Identif-ioation Problem. Given a set of input -output relations, 
find a set of systems which realize these relations. Often it is not possible 
in problems with parameters; singularities are encoxihtered . The goal is then 
to study how to avoid the singularities or to live with them. For nonlinear 
systems very little that is of great practical use is known about this problem. 

B) The Obsern^er -Compensator Problem. Given a system of form (2.1), 
design another system 


clx ^ 

^ = f(x,u,y) 

y = g(x) , 
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where inputs are the inputs and/or outputs of (2.1), which perform in certain 
prescribed ways. For example, the "observer-tr acker" problems require that y 
and X live on the same space, and that 

lim (x(t) -y(t)) =0 

t ^ 00 

C) The Classic Regulator-Servomedhanism-Optimal Control Problems, i will 
not go into these in any detail — see the Treatises by Anderson-Moore [16] and 
Wonham [19] . Here, the traditional theory only is developed in the context of 
linear systems-, about everything remains to be done for the nonlinear cases. 

Each of these problems also has a "hierarchical" structure suggested by 
the wide spectrum of applications. I have mentioned the dependence-on-parameters 
as a "higher" problem than the fixed parameters one. Even "higher" than this 
are the adaiptive and decentralized-large scale aspects, on which there has been 
much work (most of it inconclusive ) in the last twenty years. The adaptive 
problem is probably the most interesting from the philosophical- social point 
of view (e.g., cybernetics, learning theory), as well as having wide practical 
ramifications ( automata, ^ self-tuning machines, robotics, artificial intelli- 
gence, etc.). Roughly, this involves a system with unknown, perhaps changing 
parameters, interaction with some outside system which can sample parts of the 
input and output. On the basis of these samples, it is desired — preferably by 
an iterative process — to estimate the parameters to a first approximation, 
perhaps then perform certain operations and adjustments, then measure and 
identify again, etc. hopefully converging on something useful. (There is some 
analogy with the theory of "measurement" in quantum mechanics; one must consider 
not only the system in isolation, but in interaction with certain measuring 
apparatus . ) 

Having as complete as possible a theory of systems depending on parameters 
is a necessary condition for a satisfactory adaptive control theory. There 
has recently been important progress in this area by Feuer and Morse [24] , 
Goodwin, Ramadge and Caines [25], involving the analytical and stochastic, 
discrete-time side of system theory — it may be expected that analogous results 
may be obtained using geometric ideas for continuous time systems. 

Although these concepts obviously have a wide range and validity, carrying 
them out in an appropriate mathematical context is a slow business which 
requires, formidable mathematical experience and expertise. In other areas of 
science, it is possible to make use of experiment to get around mathematical 
difficulties. Thanks to computers, the analogue of "experimentation" is 
possible in system Theory, but is not as completely satisfactory as in the tradi- 
tional physical sciences. After all, if one spends several million dollars 
simulating an aircraft control system, it is of no particular use in the study 
of the US economy, although in principle they might involve much the same sort of 
system-theoretic structure, whereas data one collects about most basic physical 
systems is of wide utility in many other contexts. The difference lies in the 
role that "physical laws" play in the two disciplines! Of course, the ideal 
would be to have a close interactive relation between advances in theory — which 
are eventually going to be highly mathematical! — and efforts to carry out 
useful modelling and calculation. 
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As in mathematical physics, the precise, but nonlinear, system equations 
must be linearized for any hope of tracteibility. Although in fxmdamental 
physics we are beginning to reach situations where the essential "nonlinearity” 
cannot be ignored, in System Theory — a much younger discipline — we are still 
in the linear stage, in fact, the theory of linear system, when approached 
from this general point of view, presents enough challenges to the mathematician. 
As I hope to show in this report, the rich geometric structure of the linear 
systems is still relatively unexplored! This is especially marked when one tries 
to expand known linear theory to cover distributed parameter and two-dimensional 
problems. 


3. NOTIONS OF LINEAR SYSTEM THEORY 


From the general point of view of Section 2, a Zineccc system is one of form 
(2.1) which is linear in x and u, i.e., of the form: 


Ax + Bu 


where A,B,C are matrices of appropriate size. However, it is more in tune 
with modern differential geometry to use, instead of the classical and more 
"concrete" matrix theory, the theory of finite dimensional vector spaces. 
(Notice that Wonham' s treatise [28] is written from this point of view. This 
formalism lends itself to a greater precision of thought and statement than 
classical matrix theory, although it is, of course, completely equivalent to 
it. Another motivation is that it is better suited to application of power- 
ful group-theoretic ideas . ) 

Thus, let X belong to a vector space X, called state space; u £ U, 
controt space; y G Y, output space. if V,W are vector spaces., L(V,W) 
denote the vector space of linear maps V ^ W. (Thus, if V = R^, W = r”*, 
L(V,W) is the space of m><n matrices.) 

The system (3.1) is then determined by the tuple (A,B,C) with 
A G L(X,X); B G L(U,X); C G L(X,Y) 

Let L be the space of all systems of form (3.1) . As a space, it is just the 
Cartesian product L (X, X) x l (U, X) x l (X, Y) , 

The structural properties of system (2.1), called controlZabitity and 
obsewabittty are important; it will be assumed that the reader is familiar 
with them. (For linear systems, see any of the treatises cited above. For 
nonlinear systems, see [4].) The following results are well-known: 


dx _ 
dt 

y = 
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The system (3.1) is aontrotlable iff 

X = B(U) + AB(U) + a\(U) + ••• (3.2) 

It is observable iff 


0 = kernel (C) PI kernel (CA) f) — (3.3) 

To the system (3.1) assign the impVuse response 
tA 

t Ce B (3.4) 

or, its Laplace transform 

T(s) = C(s-A)“^B , (3.5) 

which is called the transfer funation or frequency response. 

The importance of these mathematical objects is that two linear systems, with 
the same input and output spaces, have the same input-output relations if and 
only if their transfer functions are the same. 

T(.s) admits a Laurent series in s: 



The Hankel data is: 

H = (CB, CBA, . . . , Ca’^B, . . . ) (3.6) 

Here is a basic theorem about linear systems. 

Theorem 3.1 . Suppose given one linear system (3.1) which is controllable 
and observable. Let 


dx 

dt 


Ax + Bu 


y = Cx 


(3.7) 
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be another linear system which has the same input-output data as (3.1). Then, 
there is an invertible linear map X ->• X which takes the system (3.7) into 
the system (3.1). This irtplies that (A,B,C) is related to (A,B,C) via the 
following formulas; 



(3.8) 


These formulas can be interpreted in an interesting group-theoretic way. 
Let GL(X) denote the group of all invertible linear maps: X ^ X. (Thus 
if X = R^, GL(X) is the group of all nxn real matrices of nonzero deter- 
minant. in Lie group theory, this is denoted as GL(n,R). Thus, GL(X) is 
isomorphic to GL(n,R), where n is the dimension of the vector space X.) 

Let Eqo the subset of all 


(A,B,C) G L(X,X) X L(U,X) X l(X,Y) 


such that the system (3.1) is controllable and observable. L(X,X) xl(U,X) x l(X,Y) 
is a vector space (of dimension n^ + nm+np, where n = dimension of states, 
m = dimension of inputs, p = dimension of outputs) and is an open subset 

of this vector space. Formulas (3.8) define a transformation group action of 
GL(X) on 


Remark . Eere it is useful to keep in mind some general concepts concerning 
Lie groups acting as transformation groups acting on manifolds, which will be 
briefly reviewed. On page 3, the notion of "differentiable manifold" has been 
recalled. 

Groups are encountered as algebraic structures. (By definition, a group is 
a set G, together with a map G x G ->■ G satisfying certain natural rules.) 
Groups with the "local Euclidean" structures of manifolds (so that they can be 
studied with the methods of calculus) are common. They are called Lie groups 
(after Sophus Lie, a 19th century mathematician who first isolated and studied 
them. He and Elie Cartan, a French mathematician who worked from 1890-1950, are 
the Mozart and Beethoven of the subject.) Precisely, they are sets G with a 
manifold structure and a group multiplication operation G x g G which is a 
differentiable map. 

Let Z be a manifold, G a Lie group. An aation of G on Z is a 
mapping G x z Z denoted as (g,z) ^gz, such that 


9 




9 ^( 92 ^) = ( 9 ^ 92 )^ 

for g^,g^ € G, z e Z 

Iz = z 

where " 1 " is the unit element of G 
The ovbits are the subsets of Z of the form 


Gz , 

i.e., the set of points of Z which can be reached by applying all operations 
of G to one point of Z . If 

Z r 

G - {g G G: gz = z} , 


G^ is called the stability or isotropy s\abgroup of G at z. The orbit Gz 

is then identified with the coset space G/G^. The action of G is said to be 
sinpte if G^ = identity, for all z E Z. 

A space W together with a map tt: Z W is said to be an orbit space 
for the action of G if the following condition is satisfied: 

For each w E W, the fiber it ^ (w) is equal to 
an orbit of G. 

We often denote this orbit space as G\Z. 

Here is the motivation for this terminology. Let us say that two points 
zi,Z2 are equivalent (under the action of G) if there is a g E G such that 
gzi = Z2- This defines what is called an "equivalence relation". A subset S 
of Z is called an equivalence class if, whenever z\,Z2 £ S, z\ and Z2 are 
equivalent, and if z E S, all which are equivalent to z also lie in S. 

One can now construct a new set, whose elements are precisely the equivalence 
classes; it is called the quotient by the equivalence relation. In this group- 
action case, the equivalence classes are the orbits, and G acts on the left, 
hence it is appropriate to denote it as "G\Z", read "quotient of Z by G, 
acting on the left". The reader should also be aware of the notation G/H for 
the coset space of a group G by a subgroup H. This is an orbit space for H 
acting on the right: (g,h) ^ gh“^) . 

Return to system theory. The orbit space 


E = GL(X) \ E 

CO 


( 3 . 9 ) 
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can be identified with either the transfer function T(s) (which is a rational 
fimction of the complex variable s, with values in L(U,Y)) or the Hankel 
data H. Each of these objects is an element of an infinite dimensional space. 

One can prove that the action of GL(X) in is simple, i.e., the 

isotropy subgroup of each point of E^q is the identity. Further, one can 
prove that this orbit space is a manifold and that the natviral projection map 
^co ^ is a submersion. One can also prove that this manifold does not have 
a single, global coordinate system, nor is any relatively simple and amenable 
description of it in terms of algebraic equations_known. Further, the map 
E^,q -> E is a principal fiber bundle, with base E and structure group GL(X) . 
(See [27].) It is known that this bundle has non-trivial topological invari- 
ants. 


The "identification problem" can now be stated in the following way: 

Given a parameter space. A, and a map (|) ; A ^CO' 
let <}) : A E be the map which assigns to each X € A 
the input-output data of the system (|){X) . Knowing 
only find an algorithm to lift $ to a map 


Thus, the first mathematical question is: Given a map A^ E, does a lift- 

ing map a: A E^.q exist. More important, does it have appropriate stability, 
smoothness and robustness properties so that approximations constructed in 
conditions of practice (e.g., on a computer) can be used with some confidence. 
(Mathematically, this is a question of "struct\u:al stability" — do mappings 
3: A E which are in some sense close to (j) have a lifting which is close 
to the lifting of ((> . ) 

In any case, the topological fiber bundle (^co' GL(X)) must inevitably 
play a role in identification theory. (Much the same sort of questions arise 
in stochastic identification theory, which is, in fact, the topic which is more 
often encountered in practice.) 

As an example, here is a description of E in the simplest case, namely: 
dim X = 2 ; dim U = dim Y = 1 


The transfer function T(s) is then a rational function of the form 


T(s) 


bo + bj_s 


s + a^s + a^ 


(3.10) 


Note that E has dimension 4 + 2 + 2 = 8. GL(X) has dimension four. Hence, 
E has dimension 8-4 = 4. Note that T(s) also has four free parameters, 
ao ,ai ,bo ,bi . 


However, not all of these parameters in T(s) correspond to controllable- 
observable systems with state space dimension two. We must throw away the 
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T(s) whose munerator and denominator have a common root. The roots of all 
denominators are 


s 






Thus, can be identified with the subset of {aQ ,aj ,bQ ,b^ ) -space, 

i . e - , , such that 


b„ + b. 


— a ± w a^ — 4a a 
1 f 1 0 1 


0 


(3.11) 


R. Brockett and V. Krishnaprasad [20,29] have studied diverse geometric and 
topological properties of the orbit space S in the case of single input- 
output systems . 


4. STATE FEEDBACK AND LUENBERGER OBSERVERS OF LINEAR SYSTEMS 


Continue to focus on the space of linear systems (3.1) with given input, 
state and output spaces. We have seen that one group acts on this space, GL(x), 
the group of ohanges of basis in state space . We have seen that the action of 
this group, particularly the structure of its orbit space, is a pertinent 
question in an important practical problem — the identification problem. We 
will now show that there is different group action associated with another 
important practical topic, feedback. 

Consider an open- loop ...solution 


t (x(t) ,u(t) ,y(t)) 


of the system 

dx 

dt 

y 

Let F: X U 
t -> 


= Ax + Bu 
= Cx 

be a linear map. 
(x,u,y) (t) 


Use it to construct another curve 


with 


(4.1) 
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X = x; y = y; u(t) = u(t) + Fx(t) 

Then, 

dx ^ /V 

•T— = Ax + B(u-Fx) 

at 

= (A - BF)x + Bu 

A ^ 

y = Cx 

The curve (u,x,y) is then an open loop solution of the system defined by the 
triple (A,B,C) , with: 

A = A - BF 

B = B (4.2) 

C = C 

For each F, we obtain a linear transformation 
I. ^ I. 


This map is called the (state) feedback transformation. As F varies over 
L(X,U), it defines a group transformation on E called the (state) feedback 
group. It is an abelian group, since the result of applying two feedbacks 
F,F' successively is to obtain the feedback F + F'. 


A key question is then to describe when two systems differ by feedback. 

We shall investigate this point in the next section by the introduction of a 
powerful algebraic theory, the Kronecker pencil theory. 

A related question is to construct Luenberger observers (or deterministic 
state estimates) for the linear system (4.1). Such an object may be constructed 
using the following chain of reasoning. Let us consider systems with the state 
space as (4.1), but whose input space is the direct sum of the input and output 
spaces of (4.1). Such a system would have the following form: 


dx 

dt 


A ^ A 

Ax + Bu + 


A 


Ky 


(4.3) 


Since we are only interested in states, there is no need to put outputs into 
(4.3). Let us consider an open loop solution (x,u,t) of (4.1), and feed its 
inputs and outputs into the system (4.3). Set: 

e(t) = x(t) - x(t) . (4.4) 
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Consider e as an "error", which we would like to drive to zero. This will 
be done by arranging things so that e satisfies a I'ineav differential equation 
which is asymptotically stable, i.e., such that all solutions go to zero- 

There are two questions here; one algebraic, the other analytic. Suppose 
that e(t) satisfies an equation of the following form: 


^ = ae , (4.5) 

at 

with a £ L(X,X). Let us work out the algebraic equations which must be 
satisfied: 

de _ dx _ d^^ 

dt dt dt 

= Ax + Bu - Ax - Bu - Ky 

= Ax + Bu - A(x - e) - Bu - KCx 


Thus (4.5) requires that the following conditions be satisfied: 


ft 

B = B 

A = A - KC (4.6) 

ct = A - KC 

These formulas again define a transformation from one system to the other, 
depending on the linear map K. Notice that K plays a dual role to that of F 
used for "feedback", i.e., one transforms K into an F for another system by 
dual operation of linear maps (or matrix transpose) . 

In order that the system (4.3) serve as an "observer" or "state estimator" 
for the system (4.1), it is necessary that all solutions (4.5) go to zero. This 
requires that: 


All eigenvalues of a = A - KC have 
negative real parts. 


Since eigenvalues are invariant under duality operation, this is the same thing 
as requiring that a* = A* - C*K* be stable: hence, K* serves as a stabiliz- 

i.ng feedback for the dual system. 

Clyde Martin and I have suggested 126) a method for finding such a K that 
fits in well with the point of view of this paper. Given A,B,C, we construct 
the following map 
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of GL(X) X L(Y,X) -> L(X,X) 

We proved that if the system (4.1) is observable, then the map <f>(A,B,C) ^ 

submersion map on some non-empty open subset of <GL(X) xl(Y,X). ' ’ 

Remark- Here is more differential-geometric background. Let ir: Z W 
be a map between manifolds with the dimension of Z greater than or equal to 
the dimension of W. For z £ Z, let Zg be the tangent space to Z at z. 
Let linear map on tangent vectors induced by ir- z is 

said to be a regular? point if said to be a submersion 

if each point is a regular point. (Think of "siibmersion" as "projection".) 

This suggests the "robustness" property that is needed for handling depen- 
dence on parameters and related adaptive control questions. For example, if 
(A,B,C) depend on parameters one might want to find a K(A) depending on 

X, so that the error equations (4.5) had a desired rate of convergence to zero 
The implicit function theorem offers a useful tool for deciding when this is 
possible. Suppose again that ir: Z ->■ W is a map between manifolds and that A 
is another manifold (which serves as the "parameters") . Let a: X ^ w(X) be 
a map A -> W. Our basic problem is to find a map 3= X -»■ z(X) such that 
iTz(X) = w(X) . This means that the following diagram of maps is "commutative" 



W 


3 is said to be a "lifting" of a. The "local" existence of such a lifting 
map can be decided using the implicit function theorem if tt is a submersion. 
The question of its global existence, and the situation where tt is not 
everywhere a submersion, i.e., has "singularities", is more complicated and 
must be attacked with more powerful tools from topology and the theory of 
singularities of mappings. 

Another interesting direction of research is the generalization of this 
argument to cover more complicated situations or to take advantage of different 
experimental data. Here is one possibility and an illustration. 

Suppose that t -> x(t) is a curve in X with 
e = X - X , 

such that e satisfies an equation of the following form: 


— 5 - = ae . (4.7) 

dt 


15 



Now, from (4.1) 


d^x 


dt 


2 


'"at 


du 

dt 


du 

= A(Ax + Bu) + B 

dt 


dt^ 


A^x 


+ ABu + B 


du 

dt 




A 

= ax - ax 


We can satisfy this relation if x satisfies an equation of the following 
type: 

— ^ = ax - B - ABu - Ky , (4.8) 

dt 

with 


a 



(4.9) 


Again, we have the question of choosing K so that equation (4.7) is asymptoti- 
cally stable. There are clearly many schemes like this one to find asymptotic 
state estimators for a linear system. It is a major open problem to usefully 
generalize these schemes to cover nonlinear systems. 


5. THE KRONECKER THEORY OF PENCILS OF LINEAR MAPS 


Although the Kronecker theory is one of the highlights of 19th century 
algebra and invariant theory, it is very difficult to find complete expositions 
of it. The only one in recent times is in Gantmacher's magnificent Theory of 
Matp-ioes [6], but even that becomes obscure at a key place. In [7] l trans- 
lated Gantmacher's ideas into coordinate-free language, and (I believe) made it 
clearer, but did not fill in that expository gap. Here I want to present it in 
a different way, which is much more closely attuned to System Theory. 

Before plunging into the abstract vector-space treatment of Kronecker 's 
ideas, which we will find most useful for system theory, it is useful to recall 
the more classical way that the theory developed. in terms of matrices whose 
entries are polynomials. (See Gantnacher [6] and Rosenbrock [18] for more 
details.) Let M(s) and Mi(s) be mxn matrices, whose coefficients are 
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polynomials in a variable s. Let us say that Mi(s) is equivalent to M(s) 

(in Weierstrass's., sense) if there are in x m and nxn matrices A(s) , B(s) with 
polynomial entries, such that 

M^(s) = A(s)M(s)B(s) (5.1) 

det A(s) “ 1 = det B(s) 

The "invariants" of matrices under this equivalence relation are the so-called 
elementaipy divisops, i.e., the diagonal entries after the matrix is put into a 
"canonical form" after elementary row and column operations. Let us say that 
Mx(s) and M(s) are equivalent in the sense of Kroneaker if a relation (5.1) 
holds, with A and B independent of s. M(s) is called a pencil of matvioes 
if each polynomial occurring in its entries is at most first degree. A remark- 
able theorem, proved in fa;ct by Kronecker himself, is that two pencil matrices 
are Kronecker-equi valent if and only if they are Weier stress equivalent. The 
"Kronecker pencil theory", which we describe briefly below, then provides a 
constructive algopithm for describing these equivalence classes. Such "oencils" 
of matrices are often encountered in system theory, and the Kronecker theory is 
the basic structural fact in many applications. The term "pencil" comes from 
geometry — the coefficients of matrices, when set equal to constants, define 
straight lines, since they are linear in s. Think of a "pencil" as a geometric 
collection of straight lines. 

Now return to abstract vector spaces. Let V,W, . . . be finite dimensional 
vector spaces over the real or complex numbers as field of scalars. (In fact, 
everything carries over to arbitrary scalar fields.) L(V,W) denotes the vector 
space of linear maps 

a; V W 

L(V) denotes L(V,V). GL (V) denotes the group of AG L(V) such that A ^ 
exists. The product group GL(W) xgL(V) acts on L(V,W) : 

for g^ G GL(W), g^ £ GL(V) 


The orbits of this group are readily described; They are the elements of 
L(V,W) of a given rank and nullity. The quantities dimCkernel a), dim(.a(V)) 
33^6 the invariants of this action and label the orbits. Thus, we are dealing 
with a transformation group with only a finite number of orbits and with an 
algorithm to construct the invariants of the group action. It is a very 
difficult problem to do this for geneTXxl transformation group actions — it is 
unknown for all but a few cases. (For example, it was a major labor of 19th 
century invariant theory to do this in part for algebraic actions of SL(2,C), 
the group of 2x2 complex matrices of determinant one. Thus, invariant 
theory "died" before it could be extended to other groups, and to this day it 
has barely begun to revive.) 
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The Kronecker pencil theory fits in very well with this point of view. 
Consider GL(W) x GL(V) acting on L{V,W) x L(V,W) as follows: 


for £ GL (W) , 


(giaog'\giC»ig2^) 


g^ e GL(V), 


e L(V,W) 


The Kronecker theory is an algorithm for computing orbits, invariants, and 
"canonical forms" for orbits for this particular transformation group action. 
This is one of the very rare "invariant theoretic" situations which can be 
described in this way- The theory of "quivers" due to Gabriel and Gelfand [8] 
offers one general insight into this. 

Suppose ao»a][ £ L(V,W) . A pair (V’ cv, cw) of linear subspaces is 
said to reduce (ag,a^) if 

aQ,a^(V) c w' . 


Such a pair is said to be a KiPoneo'keT veduQt'ion if there is a map 
g: V* ^ V 

such that the following conditions are satisfied: 

= - HqB (5.2) 

B is nilpotent , (5.3) 

i.e., 3^ = 0 for n sufficiently large 

V = kernel (Q q) + S(V') 

Such a triple (V',W; 3) is said to be an etementary Kronecker reduction if 
the following additional conditions are satisfied: 

dim (kernel a^) = 1 - (5.4) 


Here is the motivation for this definition in terms of the classical 
material. Consider the penait of linear maps 

a(s)=aQ + sa^- (5.5) 
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(Thus, the term "pencil" refers to the fact that a(.s) 
in s.) Consider vector-valued polynomials 


is a Vvneca? polynomial 


s 


v^s + 


n 

+ s V 

n 


in 

^0 % ^ ^ 


such that 


a (s) (v(s) ) = 0 


(5.6) 


that 


Theorem 5.1 . (Kronecker) . If a(s) is a singular pencil in the sense 


kernel a ( s) 0 

for all s , 


but 


kernel (a^) H kernel (a^) = (0) 


and if v(s) is chosen to be the mini-mal degree polynomial such that (5.6) is 
satisfied, then the elements Vf],...,v,^ are linearly independent. Further, if 
V is a linear subspace of V whose basis is then 

aQ(V') = a^(V') = W 


There are linear subspaces V" c V, W" c: w such that 


V = V' + V" 

W = W + W" 

a (s) (V") t= w" , for all s 


(5.7) 


Thus, the process can be applied again to a (s) acting on V". □ 

In order to see where the nilpotent map g comes in, let us make (5.6) 
explicit: 

0 = (a,, + sa^ ) (v + sv + ■ • • + s^V ) 

0 10 1 n 


n 2 

- a„v^ + sa^v, + • • • + s a v + sci, v^ + s oi^v + • 
00 01 On 10 11 
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Equating the coefficients of powers of s to zero gives the following relations 

= 0 (5.8) 


a, V =0 

1 n 


(5.9) 


“o’'i ^ “I'^O ' ° 


Vz * “l'’l ' ° 


(5.10) 


a V + a V = 0 

On 1 n-1 

Let V be the linear subspace of V spanned by the vectors vo,...,Vj^. Define 
0: V V' 

as follows: 


3 (Vq) 


0(v^) 


= V, 


(5.11) 


0(vj 


= 0 


Remark . g is the map which operator-theorists call the shift. 
an important underlying role in many systems-theoretic areas. 

We can now rewrite relations (5.10) as: 


It plays 


“i''o ■ ■ “o®''o 


“l''l ' ' “oS"! 


(5.12) 


a.v = - a^gv , 

1 n-1 0 n-1 
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(5.9) mecins that 




a„pv 
0 n 


Thus, relations (5,9)- (5.10) 

“l^' "" ' 

for all v' £ V 


= 0 

can be summarized in the following relation: 

(5.13) 


Only relation (5.8), i.e., vg £ Icernel ag is not implied by this relation. 

Notice that we can write the polynomial map s v(s), which satisfies 
(5.6) in a more convenient basis- independent form.:. 


v(s) 


(1 - sp) ^(V|^) 


(5.14) 


(This formula might be useful for generalization to infinite dimensional 
situations. ) 

We can now continue to split up V" in this way until there are no n>ore 
polynomials with coefficients in V which annihilate ct(s). Then we work on 
the dual situation. Finally, we end up with a non-singular pencil. This process 
can be described in an overall way as follows . 

Theorem 5 . 2 (Kronecker) . Let agf“l= V ^ W be a pair of linear maps. 

V and W can be split up into direct sums 


V = V © V" @ V- 
W = W' © W" © W"' 


such that: 


aQ/Oi (V , V",V"') <= W',W”,W"' 


i.e., each of the pairs of linear spaces reduces the pairs (ag,a]^) of linear 
maps. Further, there are nilpotent linear maps 

P' ; V V 
P": W" ^ W" 

such that the following conditions are satisfied: 
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on V 


(5.15) 


= B"a^ on V" (5.16) 

V = (kernel a^) © B(V’) (5.17) 

aQ(V') = W (5.18) 

W" = aQ(V") + kernel (5.19) 


For all but a finite number of s's, aQ + sa^ 

is an isoitiorphi.sm v"'-+ W"', i.e., this (5.20) 

component of the pencil is "regular" . □ 


This marvelous theorem gives a complete description of the algebraic 
structure of pairs of linear maps. As I mentioned above, it is a very rare 
event in invariant theory v/hen such a complete story is available. J.uckily, 
many of the situations encountered in the theory of linear, t ime- independent 
one-dimensional input-output systems involve pairs of maps. We v/ill now look 
at another, more geometric, way of analyzing the structure of pairs of linear 
maps. 


6. THE VECTOR BUNDLE ASSOCIATED WITH PAIRS OF LINEAR MAPS 


Continue with V,W as finite dimensional vector spaces, and with c«o,ai 
as a pair of linear maps V ^ W. Suppose also that the complex numbers (E are 
the field of scalars. iC^ denotes the set of pairs (sg,S)^) of complex numbers, 
denotes the multipl^icative group of nonzero complex numbers. acts on 

as follows: ((C^" denotes the pairs (sg,si) Vfith either Sq or 

Si 7^ 0, ) 


A(s^,s^) 



As^) 


The orbit space is Pi(iC), the one (complex) dimensional projective 

space, (Alternate names are t.he' projective line and the Rietmnn sphere,) 


For each s = (sq,si) e 


let 


v(s) 


e V: 




0 


1 

I * 


Note that 
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V(As) = V(s) 
for X e 

Thus, the assignment 
s V(s) 


of a complex vector space to each s G is constant on the orbits of 

hence, to each orbit irCs) G Pi(ffi) one can assign the vector space V{s) . This 

the set of 
the pair (oq , ai) . 

It is hotomorphia , i.e., defined by holomorphic functions. Of course, it may be 
singular, i.e., the dimension of the fibers may vary. If it is non-singluar , a 
theorem of Grothendieck [12] gives the precise structure, a direct Sum of complex 
tine bundles. (a complex line bundle is a complex vector bundle v/ith one- 
dimensional complex vector space fibers.) 

Since the "vector bundle" concept is a central one in modern differential 
geometry, a few words about the general setting are appropriate. A vector space 
is a set which has two compatible algebraic structures: an addition and a scalar 

multiplication by elements of a field of scalars (the real or complex numbers, in 
the typical situations) . Let S be a space. Suppose that for each s G S, is 
given a vector space V(s) . This forms a space E, v;hich consists of the pairs 
(s,v) , with s G S, v G V(s) . Assign to (s,v) G E the point s G S; this 
defines a map tt: E S . The fibers of v, i.e., the sets , are then 

the vector spaces V(s) . E is a "bundle" of vectors, in the sense that to each 
point s G S one sees attached the collection of vectors in V(s) . 

For certain applications, particularly in algebraic geometry [23], it is 
useful to put additional structures on such vector bundles. Suppose that S is 
a complex analytic manifold, in the sense that local coordinates for S can be 
chosen as complex numbers, with the maps which interrelate two such coordinate 
systems given by holomorphic functions. (Recall that, in complex function theory, 
a "holomorphic" ( e "complex analytic") function is one defined on open subsets 
of complex Euclidean space whose derivatives exist at each point.) In addition, 
suppose that the field of scalars for the vector spaces V(s) are the complex 
numbers. Then, we say that (E,n,S) is a oo'nrplex analytic or holomorphic vector 
bundle if E can be made into a complex analytic manifold such that ir is a 
complex analytic map. 

For this special way of defining the bundle using the pair (ao,ai) of 
linear maps, another, purely algebraic, analysis of the bundle is available 
using the Kronecker theory outlined in previous sections. (Grothendieck ' s 
theorem uses "transcendental", non-algebraic tools.) For example, suppose the 
first part of the Kronecker reduction process succeeds in filling up V 
completely: 

V" = V"' = (0) 


assignment defines a complex vector bundle E, [10,11,32] namely, 
pairs (ir(s> , V(s) ) , s G It is called the kernel bundle of 


Then, the following conditions are satisfied; 
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ct 6 
0 


kernel © g(V) = V 


with 3: V -»• V a nitpotent map. For s = (sq,sj) G , 


V{s) = set of all V G V such that 




= cij^Cs^ + s^3) (v) 


= 0 


(Sq + s^3)(v) G kernel 


( 6 . 1 ) 


Now, if Sjj 7 ^ 0, 


V(s) = I V G V: (l + — 3)(v) G kernel 


“4 


-1 

®1 \ 

1 + — 3 (kernel a_) 

=0 / “ 


For S( = 0, 


V(s) = V G V: 3(v) G kernel a 


and in view of (6.1), 


V(s) = {v G V: 3 (v) = 0} 


= kernel 3 


Using (6.1) again to compute the dimension of kernel 3^ 
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dim V = dim kernel (Oq) + dim B(V) 


= dim kernel (g^) + dim V - dim (.kernel g) 


or 


dim kernel (g) = dim kernel (Oq) • (6»2) 

This proves one of my main results. 

Theorem 6.1 . If aQ,a^: V -»■ W are lineeir maps with only the "singular" 
Kronecker components present (i.e., the "cosingular" and "regular" components 
vanish) , then the kernel bundle is non-singular. o 

We can use the nilpotent map g to exhibit the Grothendieck decomposition 
of the kernel bundle of (ao,ai) into line bundles. For Vq € kernel uq, 
(sofS^) € , with sg ^ 0, set: 





(6.3) 


This defines a cross-section of E over the open subset 
- (the "point at infinity") 


The one-dimensional linear singular space of V(sg,Sj) spanned by the value 
VQ(sg,Sj) of the cross-section can be continued to the point at infinity of 
(I) , since 


S V ( s , s ) 
0 0 0 ' 1 



+ • 


+ (- 1 ) 


n-1 



n-1 


, T,n-1 n-1 oU-l, . 

(-1) s^ g (Vq) 


as Sg ->- 0. (n = least integer such that g^vg = 0.) We obtain in this way 
a one dimensional sub-bundle of E- As vg varies over a basis of kernel ug / 
we obtain in this way a "concrete" version of the Grothendieck decomposition. 
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7. THE HOLOMORPHIC VECTOR BUNDLE INVARIANTS OF LINEAR, 
TIME- INVARIANT INPUT SYSTEMS 


Next we can turn to material more familiar to system theorists. Consider an 
input system 


dx 

dt 


Ax + Bu 


Here (changing notation from that of previous sections so that it conforms with 
the more-or-less standard system-theory notation) , 


X E X = state space 
u £ U E input space 


X,U are finite dimensional vector spaces; A: X ->■ X, B; U ->■ X are linear maps. 
Set: 


V = XX U 
W = X 

Write an element v of V as a partitioned column vector: 



Define linear maps 

“o'“l* ^ ” 


as follows: 


Oi 


i 



= X 



(Ax + Bu) 


Thus Equations (7.1) take the following form; 


a 


1 




= 0 


(7.2) 
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The pair (ag/ai) of linear maps then define a vector bundle E over P^{<E), 
as described in previous sections. 


For 


Thus , if 


s e o:. 


E(s) = j 

(u)“ “o(;)t-l(u)| 

= ( 
1 

^ ^ j : SX - (Ax + Bu) = 0 1 

_ 1 
" 1 

( ^ j : X = (s - A) ^ Bu 1 

1/s is not 

an eigenvalue of A, 


dim E(s) = 


dim U 


E(“) = 


(u)= Ax + Bu 


Theorem 7,1 . If the input system (7.1) is controllable, then s. ->■ E(s) 
defines a non-singular vector bundle over P]^ (I) , whose fibers are equal to the 
dimension of U. a 

This is proved in [9] . 

We can immediately use this to infer some important qualitative facts about 
controllable systems (7.1). The complex vector bundle is, following Grothendieck 
[12], a direct sum of line bundles. The degrees of these line bundles (i.e., 
their Chern classes evaluated on the generator of two-dimensional integral homo- 
logy of P^ ((D) ) are then significant numbers for system theory. (For example, 
they would remain invariant under both aomptex anatyt-ic deformations and feedback 
transformations of the systems (7.1) .) Martin and I show [ 9] that they are, in 
fact, the numbers in the Brunovsky canonical form. Their sum is then the first 
Chern class of the vector bundle E evaluated on the generator of H 2 (P 2 » Z) . 

It is the MaamilZan degree, i.e., the dimension of x. in particular, this shows 
that the Macmillan degree attached to the transfer function T = C(s-A)~^B of 
a controllable, observable system is a topoZogiaat invariant. 

What purpose does this serve? Notice that we have taken the standard 
algebraic structure of linear systems and interpreted it geometrically in a way 
that is much more amenable to dealing with more complicated systems. In [14] 
Martin and I have presented certain preliminary calculations which lead us to 
believe tliat the material does carry over to certain infinite-dimensional and 
time-varying systems. One obstacle to such applications is that the information 


27 



we would need about the classification of holomorphic vector bundles on more 
complicated complex manifolds than Pi((E) is not yet available from the work of 
the pure mathematicians, although it is a topic of intense development among 
algebraic geometers. There are also probably many more significant applied 
problems whose solution is dependent on information about the structure of holo- 
morphic vector bmdles. For example, the classification of holomorphic vector 
bundles on P3 (I) has played a key role in the At iyah- Singer -Ward theory of 
Yang-Mills "instantons", objects which appear naturally in etementaTy pca^'tio'Le 
physios. C. Byrnes has constructed analogous bundles on s >^Pi(<E), with S a 
topological space, as useful gadgets for the study of the delay systems. In 
later sections I will show how certain N-dimensional systems {e.g., the Maxwell 
and Helmholtz equation) lead in a natural way to such vector bundles on higher 
dimensional complex manifolds. 

There is also a possibility of an intermediate classification of systems by 
classifying the stvuotuoe groups of their associated vector bimdles. The struc- 
ture group starts off as GL(m,(C) , where m = dimension of input space. The 
Grothendieck theorem [12] says it can be reduced to the subgroup of diagonal 
matrices. However, it might be already given "by Nature" in such a way that the 
structure group is another subgroup of GL(m,iE) . It appears that the systems 
occurring in circuit theory, analytical mechanics, etc., all have their typical 
and characteristic structure groups; in turn, this structure group can be related 
to properties of the Hankel matrix of the system. For all of these reasons, I 
believe that people interested in applications should, in this case, relax their 
natural skepticism (which I share in principle) about fancy mathematics and 
consider seriously the possibility that the theory of vector bundles gives a 
valuable way of describing unified properties of systems. 


8. FEEDBACK EQUIVALENCE AND THE KRONECKER THEORY OF 
PAIRS OF LINEAR MAPS 


The work that Martin and I have done can be usefully interpreted in the 
language of category-fimctor theory. We attach to the "category" of (linear, 
time- invariant) input systems essentially two "functors". One is geometrio , 
with the category of holomorphic vector biandles, the other is aZgehraio , with 
the category of pairs of linear maps. The reader who is familiar with such 
things will recognize that this is a typical situation in such avante-gairde 
branches of mathematics as algebraic topology and geometry. In these areas it 
has been found that it is precisely in reconciling the "geometric" and "algebraic" 
world view that some of the most significant and useful relations appear. Now, 
from the engineer's point of view, the natural "isomorphisms" of systems are the 
feedback transformations. The natural isomorphisms of pairs of linear maps 

(uQ,a^) e L(V,W) X L(V,W) 

are the action of the group GL(V) x GL(W) . The fact that they are the same 
thing is very worthwhile proving explicitly, because it plays such a fundamental 
role. (If it had been recognized earlier, say in 1970, it would have simplified 
the task of the authors of many papers.) 
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Now, it is obvious that feedback equivalence of systems implies equivalence 
under GL(V) x GL(W) of the corresponding pairs of linear maps. We must deal 
with the converse: Suppose then that 


dx 

dt 


Ax + Bu 


( 8 . 1 ) 


d*x 

dt 


A'x + B'u 


( 8 . 2 ) 


are two linear systems. Set: 


V = X X U = set of pairs ) 



W 

= X 

• 



= X 



%(u) 

= - Ax 

- Bu 


<0 

= X 




= - A' 

X - B ' u 

Let us 

then suppose that 

(Uq fUi) 

(uq , af ) 

, i.e.. 




“o 

^l“0^2 



“i 




g^ e GL(X) 



(8.3) 


(8.4) 


Then, 


e GL(X0U) 




■ 
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Suppose is written in partitioned fotm: 


:(u) = 


11 ^ 2,12 


2,21 "^ 2 , 22 - 


C) 


^ 2 , 11 *-" * 2 , 12 " 
* 2 , 21 * + * 2 , 22 " 


* 2,11 ® * 2 , 12 ^’'*“'’'’' 


* 2,21 * 2,22 


Compare (8.5) and (8.6); 


^2,11^ ^2,12^ = V 


hence 


^1 = 

^ 2,11 


^ 2,12 

= 0 


Plug (8.7) and (8.8) back into (8.4); 

“i* 2 (^) ' * i “ b ( u ) 

or 


= - g^ (Ax + Bu) 


or 


a ' 
0 


SlX 


^^ 2 , 21 ^-^^ 2 , 22 ^ 


( 8 . 6 ) 


(8.7) 


( 8 . 8 ) 
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(Ax + Bu) 


A'g^x 


®' < 32 , 21 ** 92 , 22 “’ = 


or 



A’g 


1 



+ A 


(8.9) 



( 8 . 10 ) 


These formulas can be summarized in the following way. 

Theorem 8.1 . The two input systems (8.1), (8.2) lead to GL(X) x GL(X © U)- 
equivalent pairs of maps if and only if the systems differ by state feedhaoh and 
by change of basts in input and state space. □ 

This result has two sorts of ramifications in system theory. First, it 
enables the state feedback classes to be classified by the Kroneokev pencil 
theoicy. in fact, it was Brunovsky [ 5 ] who did this in a way independent of the 
general Kronecker theory. Second, it indicates a major algebraic difference 
between state feedback on the one hand and output feedback and various sorts of 
"compensators" on the other hand. If the latter sort define a group (which is 
not always clear) , usually the orbits of these groups on systems are not equiva- 
lent to the problem Kronecker solved, i.e., equivalence of pairs of linear maps 
under isomorphisms of domain and range spaces. Problems of enumerating orbits 
are extremely difficult in our present state of knowledge of invariant theory 
if they do not reduce in one form or another to the Kronecker problem or are 
closely related to it. It is then a lucky accident (?) that state feedback is 
governed by the Kronecker invariant theory. 


9. TRANSFER FUNCTIONS AND HOLOMORPHIC VECTOR BUNDLES 
FOR 1-D AND n-D SYSTEMS 


The geometric methods introduced by Martin and me into system theory depend 
on associating with a finite dimensional, time- invariant input-output system a 
vector bundle on the Riemann sphere. Note that the Riemann sphere (the complex 
numbers with its one-point compactif ication) is the only Riemann surface of 
genus zero. This point of view is not really so avante-garde as it might sound, 
since it is very much in the spirit of the one-complex variable methods used by 
electrical engineers in the 1920' s to '40s, (This is the material that was 
supplanted by "state space methods" in the late 1950 's and 1960 ’s.) We believe 
that there is considerable potential for applying these methods (enriched by 
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the modern mathematical research) to more general systems. I will now develop 
material in this direction for systems described by ordinary and partial differ- 
ential equations. 

There is a vast field of engineering practice called the "theory of two- 
dimensional filters", which as yet has no systematization and mathematization 
(as the "state space" theory systematized and mathematized one-dimensional 
filters) . I believe the ideas presented here might put us on the road 
to such a theory. Another interesting possibility is an application of complex 
manifold methods to the systems of pariiaZ differential equations which occur in 
physics. 

My plan is to try to construct holomorphic vector bundles with such systems. 
We shall see that this is a modern a Igebro- geometric version of the traditional 
"transfer function" methods used in system theory and electrical engineering. 

Let us begin with well-known material. Consider a finite dimensional, 
linear, time-invariant input-output system 


— = Ax -(■ Bu 

dt 

y = Cx 


Convert this into an algebraic equation using the Laplace transform 


x(s) = J 
0 


x(t) e dt , 


with zero initial conditions. 


sx(s) - Ax + Bu 


y = Cx 


y = (C(s - A)”^B) u 


T(s) = C(s-A)”^B 


( 9 . 2 ) 


is the transfer funation or frequency response, if u and Y are the linear 
input and output spaces, T is the natural mapping 


C L(U,Y) 
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A more appropriate gadget from the Riemann surface is the sequence 


0 = C(s-A) ^Bs^ds 

n 


(9.3) 


of differential forms. Let us see how they behave at s = oo. set: 


1 

u = ~ 

s 


du = - u ds 


6 = - C 




-n -2 , 
Bu u du 


- C(l- Au)"^B u du 


= - C (1 + Au + A^u^ + • • • ) Bu (Ju . (9.4) 

This shows that the 9^ are mevomorphia diffevent-ioL forms on the Riemann 
sphere. Their residue at u = 0, i.e., s = «•, is especially important; it 

is the "Hankel data". 

residue 0 = - Ca"^ B . (9.5) 


We are especially interested in the inverse Laplace transform 


j T(s)e^^ ds = / nT * (9.6) 

a-i«o n=0 

This formula describes the input— output relations in terms of objects which have 
an a priori meaning in terms of Riemann surfaces, the "Abelian Integrals" 


/ 


9 

n 


It is a well-known and classical idea (in mathematics) that the appropriate way 
to generalize these ideas is to replace the Riemann sphere with other compact 
complex manifolds. 
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In previous sections I have described the relation between system theory 
and Kronecker pencil theory. Here is a more general version. Let us rewrite 
(9.1) as 


— - Ax + Bu = 0 

dt 

y - Cx = 0 

These equations have the form 


(9.7) 


a 


1 dt 




= 0 


(9.8) 


where ag,ai are linear maps: X©U©Y X©Y. 

Thus, the general problem of the theory of linear, time- invariant, finite 
dimensional systems is covered by the following equations 


dv 
“l dt 


= 0 


(9.9) 


where V,W are vector spaces, ag'^1 linear maps V W. Associated with 

this is the pencil 

a(s) = a^s + Uq (9.10) 


of linear maps. 

We can assign to the pencil (9.10) a vector bundle 


tt: E -s- P^CiC) 


E = {(s,v):S€a:, v£V:cx(s)v=0} 

We can now generalize these constructions to cover "distributed parameter" 
systems . 

Let ti,...,tj^ be independent variables; V,W are finite dimensional 
vector spaces. 

a„»a, f • • • /a : v W 
0 1 n 

are linear maps. Consider the differential equations: 
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0 


(9.11) 


3v 

^1 3t, 


+ a 


3v 

n at “o"^ 


Associated with this we have: 


s 

= (s , . . . ,s ) € (n” 
1 n 

(9.12) 

a(s) 

= a,s +---+a s 
11 n n 

(9.13) 

E 

= { (s,v) : a (s) (v) = 0} 

(9.14) 

IT : E 

I 



Tr(s,v) = s 


E should be aompteted to be a veotor bundle over a compact complex analytic 
manifold X on which is embedded as an open subset. How this is to be 

done i-n general will be left open for the moment. Consider some simple examples 
motivated by physics. 

Suppose given two independent variables t]^,t 2 . (In the physical applica- 
tions, they may be two space variables or one space, one time variable.) Denote 
partial derivatives of functions of these variables by subscripts. Consider the 
equation of "Helmholtz" type: 


Vi 


+ y 


^2^2 


+ Ay = 


(9.15) 


A is a constant, u a scalar function of tj,t 2 , u is the "intput", y the 
"output". This can, of course, be converted to an equation of type (9.11) by 
introducing more dependent variables, but it will be (for the moment) more 
convenient to work directly with Equation (9.15). The "transfer function" is 
obviously the following rational function of two complex variables 3 ^, 32 : 


T(Si,S2) 


2 2 

S1 + S2 + X 


Thus , 


(9.16) 


y(ti,t2> 


// 


(si t]+S2t2) 


u(s^,S2) ds^ ds^ 


(9.17) 
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taken over an appropriately chosen two-dimensional submanifold of might 

be a solution of (9.15) . 

Note that (9.15) is invariant under the rotation group SO(2,R) acting in 
R^. This translates into T depending only on sj+sl# as we know ej^licitly, 
as shown from formula (9.17) . 

We now want to consider (9.17) as an integral over a manifold. In order to 
do this, we shall utilize the connection between differential forms and integra- 
tion on manifolds, which is explained in nany references (e.g., [21-23]) . This 
requires that we introduce the exteviov product operator "a" and the exterior 
derivative "d". Thus, the "element of integration" in (9.17) is 

"ds^ds^" = ds^ A ds^ 

(Recall the algebraic rules 

ds^ A ds^ = - ds^ A ds^ 

ds^ A ds^ = 0 


These rules of "Grassmann algebra" provide a convenient algebraization of the 
familiar rules from advanced calculus for manipulating and changing variables in 
multiple integrals.) 

Introduce polar coordinates in the integral (9.17): 


2 

z 


ds^ A ds^ 


2 2 
®1 ^ ^2 


z cos 0 , 


(dz cos 0 - z sin 0 d0) 


z dz A d0 


z sin 0 

A (dz sin 0 + z cos 0 d0) 


Set: 


w = cos 0 
dw = - sin 0 d0 , 

d0 dw 
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ds, A ds_ = 

z 

dw A dz 

1 2 

/l - w^ 



(9.18) 


Let us now suppose that u i-s cl funoti-on of z atone. Then (9.17) can 
be rewritten as: 


y( 


r c z(tiw + 

V‘2> ' Jj^ 


t2»^l-w^ ) 


u(z) 


dw A dz 


+ E /I - w2 



z (t^W + t 2 *^l-w^ ) 


dw 


/l - w2 


zu(z) 

2 , 
z + X 


dz (9.19) 


This formula exhibits the potential "algebro-geometric" nature of the situation. 
We have the algebraic correspondence 




V 


2 2 
=1 =2 


(9.20) 


of <cl . (Note that is is not a "rational map".) The fibers are the orbit 

of the orthogonal group SO (2,3;) . They are the circles 

2 2 2 

^1 ^2 = ^ * 


In order to get a better idea of the algebro-geometric nature of the inte- 
grals (9.19), let us use a power series expansion for the ej^onential function. 
Set; 


y ~ f[ (zw)^ (z /l-w^ ) 
n f in. .J ^ \ ^ 


m 


dw 


/l — w^ z + X 


A u(z) dz (9.21) 


i.e. 


r 
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( 9 . 22 ) 


y(t^,t2> = 


, n . ni 

” 1 1 

E l ^ 

n nlm! 
n,m=0 1 2 


Thus, it might be appropriate to consider the differential forms 

(zw) I z V 1 - ^ 


dz 


n,m 


z"+ X 


0 


n,m 


n m 
^^2 


2 2 , 
Si + S2 + A 


dSiAds2 


( 9 . 23 ) 


( 9 . 24 ) 


as the characteristic "geometric objects" attached to the input-output system 
(9.15) . 

This suggests that we consider separately the differential forms 

^ z /l - j dw 

a = (zw)'^ . (9.25) 

with z considered as a parameter. They are essentially differential forms on 
the Riemann surface whose local variable is w, i.e., the algebraic curve 

2 2 

Wi + «2 = 1 . 


(A Rtemann surface is a complex analytic manifold that can be parameterized by 
one complex variable. Of course, another point of view is to regard 


z (t jw + t2/l-w^ ) 
e 


dw 


1 - w 


as a differential form on this Riemann surface. (Its indefinite integral can 
be written down explicitly in terms of Bessel functions.) 

This simple example clearly gives us much new material to consider for a 
general theory of systems from a complex manifold-algebraic geometric point of 
view. 
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10. VECTOR BUNDLES ON ORBIT SPACES 


We can immediately see a general pattern to the example treated in the 
previous section. 

Let X,Y be spaces, 

(j): X -»■ Y 

a map. Suppose g-Lven a vector bundle 
ir; E -> Y 

For y £ Y, the fiber E(y) = (y) is a vector space. It defines a vector 

bundle 

on X, called the puZl-baoh bundle 

(|) ^(E) = { (x,v) : v £ E(4)(x))} 


There is a commutative diagram 


-1 


(E) 


> E 


X Y 


Consider a given bundle E' on X and a map <j> : X ->■ Y. We can ask 
whether there -Is a bvmdle E on Y such that 

E' = (j)~^(E) . 

One can also ask whether there is an equivalent vector bundle E” on a bundle 
E such that 

E" = (j)”^(E) 

These questions are especially interesting for systems theorists (and physicists!) 
if the following additional structure is put on: 
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G is a transformation group on X. Y = G\ X is the 
orbit space, (j>: X-»-g\x = Y the map which sends 
X £ X into the orbit Gx on which it lies. G acts 
as a group of automorphisms of the vector biindle E'. 

Let us return to the context of Section 9. 


11, VECTOR BUNDLES DEFINED BY SYSTEMS OF PARTIAL 
DIFFERENTIAL EQUATIONS AND LINEAR 
GROUPS OF SYMMETRIES 


Let V,W be complex vector spaces, n an integer, and let <1 q ,ai , . . . ,aj^: 
V -V w be linear maps- We can then construct the system of li.neco' constant 
coefftatent partial equations: 


3v 


3v 

+ a -rr = 0 

n 3t 0 


( 11 . 1 ) 


to be solved for a function 


t = (t , , , . ,t ) -> v(t) , 
1 n 


i.e., a map R V. 

We can then associate with this system the n-complex variable "pencil" 


a(s) = a-s,+---+as 

11 n n 0 


( 11 . 2 ) 


of linear maps: V W, and the hoZomov'ph'ic vector bundle 


E = {(s,v):s=(s-,...,s)€CC; a(s)v=0} 

1 n 


(11.3) 


Now let G be a group. Suppose given three linear actions on (D^, V 
and W. 

Definition . G acts as a symmetry group of the pencil a(s) if the follow- 
ing condition is satisfied: 


got (s)g ^ = a (g(s) ). 

for all g 6 G 


(11.4) 
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Such an action determines an action of G on the vector bundle E; 

g(s,v) = (gs,gv) . (11.5) 

(If a(s)v = 0, note that using (11.4) 

a(g(s))(gv) = ga(s)(y) = 0 , 

so that the action (11.5) really does map E onto itself.) 

Thus we can form the orbit space G\ iC^ and ask whether the biuidle E 
comes from a b\indle on this orbit space. This is clearly an interesting system- 
theoretic way of defining vector bundles! 

12. SO (3,1) -BUNDLES 


If G has a known Lie-theoretic structure, we can use Lie group representa- 
tion theory to analyze the bundles constructed in Section 11 . One of the simplest 
cases (and the most important for physicists) is that where G is the three- 
dimensional rotation group. For algebraic reasons we complexify everything. 

We are supposing that n = 3. Refer to [30] for notation and 


ideas used here. 

Thus, 

r 

, 02^03 are linear maps: 

V ->■ W. If 

s = 


^12 

%3\ 


g = I 

^21 

^22 

^23 




^32 

933 / 



g(s) = gs = matrix multiplication 
It is convenient to write 

a = (a^a^a^) 


(a) <s) = (a^a^a^} 
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Suppose g is an orthogonal 3x3 complex matrix, i.e.. 


( ' denotes transpose of a matrix. ) Let 


G ^ L(V) 


G L(W) 


be the given representations of G by linear transformations on V and W. 


or 


02 (g) (aj^a2Ct3^ cr-]^ (g 


^ . 

^(g(s)) 


I^l\ 



^2 



\=3 / 


( = 3 / 


or 


a2(g)ttQa^(g 


-1 




( 12 . 1 ) 

( 12 . 2 ) 


Now, 


L(V,W) H V ® W 


J 

(W = dual space to W. In fact, all finite dimensional representations of G 
are self-dual.) The "Clebsch-Gordan" rules for decomposition of tensor product 
representations of SO (3) tell how many such independent a's there are. 
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The vrveduo'thte (finite dimensional) representations of G are parameter- 
ized by cin integer j ^ 0 ( = spin) . The vector space for the spin j- 

representation has dimension (2j + 1) . Thus, V and W can be split up into 
a direct sxm of vector spaces V- ,Wj in each of which G acts via a direct 
sum of spin j~representations. using "Clebsch-Gordan" we see that 

a(s)(V^) c + Wj ^j-1 (12.3) 

cXqCV J c Wj , j = 0,1,2,. . . 


Example . Consider the Helmholtz operator with input u, output y; 


sy+Xy-u = 0 


Set: 


= s^y , i = 1,2,3 

Thus, 

3 

/Z s .X . = - Xy + u 

itl ^ ^ 


or 



(12.4) 


V 


R 


5 


1 



r 


\ 


/ 


4 3 




w 




1 ‘ ^1 \ 

>‘2 1 
X 


S2Y - 

3 

1 '' 1 


1 ^3^ " ^3 

\u / 


\ E - u / 

i 


(12.5) 


Thus, we see that V is a direct sum of a spin one subspace S and two 
spin zero sxabspaces. W is the direct sum of an Sj and an Sq- Now, Clebsch- 
Gordan gives; 


S, <8> S, = ® S, 


= S, 


i.e., there are intertwining maps 


* ^0 


=0® h 


It is these that are obviously present in (12.4). 


13. THE COMPACTIPIED VECTOR BUNDLES DETERMINED RY 
THE HELMHOLTZ EQUATION 


Continue with the situation of Sections 9-12, the linear differential 
equations which are invariant under the action of SO (3, 1) on iC . These 
equations cover roost of those of interest in classical mathematical physics! 

For s G let 

E(s) = {v £ V: a(.s)(v) = 0} . (13.1) 

As s varies over (C^, E(s) defines a vector bundle whose basis is £C^. 

G = S0(3,(C) acts linearly on E. 

We want to examine the a Igebro- geometric structiore of this bundle, and any 
possible bundles with orbit spaces 
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homogenize" Equations (11.3) , (12.4) . Introduce 


G\ 


The first step is to " 
homogeneous coordinates 


with 




s . = 

1 


i = 1,2,3 


Equations (12.4) take the form 



These equations detevm-ine a veatoT hundle E whose base -is P 3 (C) , wh-lah 
restr-Lets to the gi-ven bioidle when is errhedded as the "affine" subspaoe 

of Psd). 

Let us work out the fibers of this bundle. If Tq 0, these equations 
are equivalent to (11.3), and the solutions E(s) obviously form a one- 
dimensional vector space. The "input-output" relations are 



This gives a "geometric", systems-theoretie meaning to l/(s^+A) as the 
"transfer- function- fundamental solution-Green' s function" for the Helmholtz 
equation ! 
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For 


T 


0 


0 , 


Equations (13.2) take the following form; 


T .X = 0 

.1 


E 


i=l 


T .X. 
1 X 


0 


(13.3) 


Since one of the is nonzero, (13.3) forces y = 0. Then, (13.3) reduces to 



(13.4) 


E(t) consists of the (y,u,xj^) such that onty (13.4) is satisfied. Thus, 


dim E(t) = 3 


The vector bundle E does not have constant dimension as t ranges over P 3 ((n). 
we see that there is here a distinct difference from the one-dimensional situation! 

However, we can construct an equivalent bundle E' on which does have 

a non-singular extension to p 3 (iC). Namely, 

E' 


Again, put 


(y,u,s) : y = 


s^ + A 


u 


s . 

1 


T . 

1 
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Thus 


E' (t) 



Thus, if = 0, 

E ' (t ) - { (x,u) ; X = 0} 

We see tiiat for aZZ x £ P 3 (ff) , 


dim E ' (t ) = 1 


We can also examine how E' projects down to the orbit space 50(3,0:) \ 0:^ . 

Map 

3 

(|) : T 0: 


as follows 


<j) (s) = s = z 

The fibers are the orbits of 50(3,0;) . We would now like to "projectify" this 
situation 
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(13.5) 



Note that this map sends one-dimensional linear siabspaces of into one- 
dimensional linear subspaces of hence it passes to the equivalent to 

define a map 




(13.6) 


The orbits of S0(3,Ct) lie in the. fibers of (J>. 

Thus <{> and the vector bundle E' are very compatible. 


E(t) = < (x,u) : X = 




+ Xt, 


Set: 


E"((z^,z^)) = 


{x, u) ; X = 




u! 


This formula defines a non-singuloj? vector ^bundle on Pi (fl;) 


E* = (t> ^(E") 


i.e., E' is the pull-baak of the bundle on Pi (I) . 


14. LINEAR FILTERS AND GROUPS 

So far, we have been considering the class of input-output sysrems defined 
in terms of differential equation models. Of course, "system theory" is much 
more extensive, and not necessarily tied to all such models. In this section 1 
will briefly describe another foundational approach. It too has a basic geometric 
aspect that has barely been developed. 

Consider a "system" as a mapping from certain input data to certain output 
data. It is important to specify some sort of mathematical structure on the 
input -output data and to consider systems which interrelate this structure in 
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some way. For example, let us analyze the linear, finite dimensional, time- 
invariant state space models from this point of view. Consider one, say of the 
form 


dx 

dt 


Ax + Bu 


y = Cx 


(14.1) 


x,u,y are vectors in finite dimensional real vector space, and A,B,C are 
matrices of the appropriate size, t is a continuous time parameter. Let 
u: t u(t) be an input function. We can then solve (14.1) W'lth zevo 'In'it'Ldt 
conditions to define the output t ->■ y(t) ; 

t 

y(t) = j Ce^^^ Bu(s) ds . (14.2) 

“ □ 

system as a mapping u y which is linear and of the 


K(t,s) u(s) ds , (14.3) 


We can then consider the 
form: 

00 

y(t) = / 

~oo 


where (t,s) ->■ K(t,s) is a mapping R x R (matrices) such that the following 
conditions are satisfied: 

a) K(t,s) =0 if s < 0 (14.4) 

b) K(t,s) = 0., if s > t^ ,, (14.5) 

c) K(t + f',s) = K(t,s-t'), t,t',s £ R (14.6) 

Condition (14.4) is just a normalization of the output at t = 0, and is not 
particularly important. Condition (b) is crucial, since it expresses oausatity, 
if the incoming signal u is zero for time t< 0, it guarantees that the output 
y is also zero for t<5. Condition (14.6), which expresses translation invari- 
ance, is a group property , and suggests ,that group theory might be the appropriate 
general setting for some of these ideasc 

For example, one might replace the role of the real niimbers R, i.e., set t 
by the integers, Z. An input is then a map u: n u(n) of Z (input vector 
space U ) , an output is a map y: Z (output vector space Y ) , and the "system" 
is a map 


u -»■ y 

of the form 


00 

y(n) = ^ K(n,m) u(m) 

■ in=-oo 


(14.7) 
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We can immediately write down the analog of conditions (14,4) -(,14 .6) . Such 
input -output relations might be generated by the difference equation analog of 
the differential equations (14.1) : 


n+1 


X + Ax + Bu 
n n n 


y 


n+1 


Cx 


n+1 


(14.8) 


However, we can generalize relation (.14.7) to other groups. For example, replace 
the integers Z with the direct product group Zx Z. A system then might be a 
map (input) (output) of the form: 


y(n^,n2) 


^ K( (n^,n^) , (m^,m2)u(m^,m2) ) 
(m^,m2) 


(14.9) 


Relation (14.6) generalizes readily to this case. However, the right generaliza- 
tion of (14.5) — causality — is less existent. In fact, there is now a large body 
of current engineering literature- lore concerning such systems, called, say, 

2-D dig'ltaZ fi-Zteps , and the lack of a natural generalization of "causality" is 
one complicating feature that has slowed down the development of a systematic 
theory paralleling the non-traditional 1-D theory. Of course, these non- 
traditional hybrid system-filter problems are extremely important in the state 
of today's technology, particularly in terms of realization of the ultimate 
potential of digital computers. Since I believe that geometric-group theoretic 
ideas are potentially important here also, I will now sketch a general setting 
for this approach. I will also indicate how some of the stochastic filter- 
system problems can be considered in the same way. 

Suppose given the following data: 

a) A real vector space U called the input space, 

b) A real vector space Y called the output space, 

c) A space T called the parameter\time-space . 

U and Y may be infinite dimensional and T may be continuous or discrete. 
However, the functional analysis complications are reduced if we work with the 
case where T is discrete 

d) A a-field of subsets of T, and a measure dT on this sigma-field. 

(In this section it will be assumed that the reader is familiar with the rudiments 
of measure theory and functional analysis, say at the level of Ref. [33].) 

Let t^/(T, U), denote the space of mappings with domain t and 

range U and Y. A tineop system, in this framework, now might be defined as a 
triple ( 0,C/O with the following conditions satisfied: 

a) <= ,y^(T,U) 

b) 0 <=. ^(T,Y) 

c) ^ and 0 are linear subspaces of the vector spaces in which they are 
embedded, by (a) and (b) 
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d) CfC is a linear map: ^ , of the following form; 


J^r(u) (T) 


/ 


k(t,t')u(t') dT ' 


where is a mapping of 


(14.10) 


T X T (space of linear maps U->Y) 

(Of course, one must also postulate sufficient functional analysis detail in 
order that the terms in (14.10) even make sense. In certain situations it is 
desirable to relax the framwork to allow the "kernel” K to be a distribution- 
generalized function in the sense of L. Schwartz and F. Gelfand.) 

In order to make contact between this general framework and contemporary 
engineering (and I want to emphasize how widely these ideas are dispersed 
throughout mathematics, statistics, and the signal-information-corrmuanication- 
filtering- prediction parts of technology) , it is of course necessary to special- 
ize further. There are two aspects I want to discuss briefly here because of 
their considerable geometric significance: 

a) A group G acts on T as a transformation group, and certain 
relations of compatibility are presented between G and the 
system ( ,9 , 0 ,Cf{) . (The prototype is that where T = real 
numbers or integers, G = additive group, and the "compatibility" 
relations are those which describe "stationarity" of the processes.) 

The group action is denoted as 

(g, t) gx 


b) G acts on the products T x u, T x y, in the following way: 


g(T ,u) = (gT,a(T,g) (u)) 

g(T,y) = (gT,B(T,g) (y)) 

for T e T, g e G, u € U, y € Y. 


(14.11) 


we require that for each (x,g) , ct(x,g) are linear maps on U 
and Y. 


(Note that the conditions that (14.12) define a transformation group action on 
Txu and Txy requires that certain conditions be satisfied by a and 3- 
We shall not write them down explicitly.) 


This action (14.11) then defines an action of G on 
as follows: 


(gu) (x) 

= a(x,g) (u(g ^x) ) 

(gy) (x) 

= 3(T,g) (y(g ^x) ) 


c^/'(T,U) and , 


(14.12) 
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One can now require that systems, defined as linear maps 




commute with the action of G on <y^(T,U) and ty//(T,Y) . This is a sort of 
generalized "stationarity" . Here are some examples that appear with great 
frequency in the engineering literature. 


Example . 2-D translation-invariant digital filters with soalar signals. 

Let U = Y = 1, the complex numbers, and let T — Z x Z, G = Z x z acting 
on itself. Let G act on i//5^(T,U} and via simple translation: 


(gu) (n^,n^) = u((n^,n 2 )-g) 


(gy) (n^,n^) = y((n^,n 2 )-g) 

for gGG — Z^Z, ( n^ , n^ ) ^ Z x z 


(14.13) 


Let K; T ^ t be a map and let 


JiT: a^(T,U) ^ c.//(T,Y) 

be the map defined for the following ..formula : 

J^T(u) (n^,n2> = ^ K(n^ - m^, n^-m^) (14.14) 

(m^,m2) € ZxZ 

This linear map is translation invariant, i.e., 

J4f(gu) = gj>ir(u) . (14.15) 

Let us now exploit the invariance under G. Decompose .^i^(T,u) into 

linear subspaces which transform irreducibly under G. This can be done in the 
following way: Let G* denote the set of all group-homomorphisms 

A: G ^ T 

Two such homomorphisms can be multiplied; they form a group called the dual group 
to G. In this simple case (i.e., G=Zxz), G* can be identified with 
Cxa:; For z = (z ,z) GfCxiC, 

nizi + n2Z2 

^z^^l'^^2^ = e . (14.16) 

For A = z e G*, let u £ be the map defined as follows: 

— z 
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—njZj — H2Z2 

u (n ,n ) = e (.14.17) 

— z 1 2 

for G T 

Thus, for g G G, X £ G*, 

g(u^) = X(.g)u^ (14.18) 

This equation means that u^ "transforms via the irreducible representation X". 

The "digital filter" ^ now commutes with G, hence maps u^ into a 
linear subspace of which transforms via X. Set 


-niZi-n2Z2 

= e 

Thus, for each X = z H (z^,Z 2 ) G (Cxfl: h G*, 

JT(u ) = T(z)u (14.19) 

-z -z 

O) 

with T(z) € it 
This f\inction 

z -*■ T(z) (14.20) 

is called the transfer function of the filter/system 

It can, of course, be computed in the more traditional way in terms of the 
kernel K. 


^ K(n^-m^, n^-m^) u^(m^,m2) 


( r ) 


“in 1 z “ ni 2 z 2 


= ^K(n^-m^, ^2 - m^) e 

- (m]^+n j^) Z]^ + (m2+n2) Z 2 

= ^ K(m^,m 2 ) e 


-™lZi-n2Z2 

= K(m^,m 2 ) e 

hence : 

-m 1 z 1 — m 2 Z 2 

T(z) = ^ K(mj^,m 2 ) e (14.21) 
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The right hand side of (14.21) is called (at least in the engineering literature) 
the z-tpansform of K. 

The mathematical problems that are most important for applications involve 
conditions for causality and stability in terms of T. 

Example 2 . Stationary stochastic process as inputs and outputs. 

This example is a traditional one in the engineering literature (and goes 
back to the original work of Kolmogoroff and Wiener) . First, let us review some 
probability theory. 

Let be a space with a fixed 0 -field of subsets and a probability measure 
d(i) defined on this family of subsets. A random variable is a measurable mapping 
f; ->■ R. The integral of f with respect to the probability measure dm is 

called the expectation of f, and denoted as follows: 

E(f) E / f dm (14.22) 

Q. 

Let H denote the vector space of measiurable functions f: -> R such that 

E(f^) E r f^ dm < “ (14.23) 


H then forms a Hilbert space — "the" L of dm. The inner product is 




The norm is 


I|f!l E <f,f> 


1/2 


Let T again denote a parameter space. (It will suffice to suppose T 
discrete. This avoids certain analytic complications of the theory of stochastic 
processes.) For our purposes, it suffices to restrict attention to random 
variables that are square-integrable, i.e., lie in H. A stochastic process 
(parameterized by T ) is a mapping 

S: T H 


Suppose such a process S is given, and in addition, G is a transformation 
group on T. Let 0(H) be the group of isomorphisms of the Hilbert space H, 
i.e., the space of linear maps A: H ->■ H such that: 
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exists 


The inverse map A ^ 

<Ah^,Ah2> = <h^,h2> 

i.e., A is isometric 


Definition . The stochastic process S: T -»■ H is stat'ioruu'y (relative to 
group G which acts on T) if there is a group-homomorphism a: G ->■ 0(H) such 
that 

a(g)(S(T)) = S(g"^T) (14.24) 

for g € G, t 6 T 

Remark . Condition (14.24) means that the "means" and "correlations" 

T E(S(t) ) 

(Ti,T2) -> E(S(t^)S(t2) 

are inoccptant under translat-ton by G. in the case where T = R or z, G = 
additive group of translations, this is what is usually called "weak stationarity" 
in the theory of stochastic processes. 

Having defined "stationary processes" in this way, we may now define fitters 
as maps whose domain and range are stationary stochastic processes. For example, 
if T = G = Z, it is readily seen that such tinear filters are of the form 

f k*f, 

where k: T -»■ R is a real -valued function and * is the usual coKOOtution. 

Again, as in Example 1, this leads us back to "harmonic analysis" of the group 

G. 


In summary, in this section we have briefly seen that there are possibili- 
ties for application of general Lie group-geometric methods in the theory of 
deterministic and stochastic filters. 


15 o CONCLUDING REMARKS 


Certain general principles for the application of modern differential 
geometry to system theory have been presented. These principles have been 
applied (in joint work with Clyde Martin) to the feedback structure of linear 
time- invariant systems. The key technique is the theory of hotomorphic vector 
bundtes . Such bundles can also be defined for distributed parameter systems, 
although they now have singularities and their "poles" become submanifolds. 
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This approach constructs a bridge between the classical work in electrical 
engineering and control theory concerned with the description of systems via 
^'transfer functions” and modern differential and algebraic geometry. Where the 
engineer speaks of "poles", the geometer considers "intersections". One might 
expect that mathematics of this sort will become essential for a systematic 
treatment of the "non-classical" control problems that are on the technological 
horizon . 
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APPENDIX 

GENERALIZATION OF THE PSEUDO- INVERSE CONSTRUCTION TO 
RIEMANNIAN MANIFOLDS; SLICES AND CANONICAL 
FORMS FOR GROUP ACTIONS 


One of the fundamental mathematical problems of system theory is to find 
liftings of mappings in the following sense; 

Let X,Y,A be spaces and let 

it: X Y 
(j> : A Y 

be mappings. We want to find mappings 
4>' : A X , 

such that 


^t|)'(A) = (f)(A) 

for all A £ A 


Further, one wants to let (j) ' depend on <|) in a reasonably "robust" way. 
For example, the estimation problem takes this form: 

X = a space of systems 
Y = a space of input-output relations 

IT : X ->• Y the map which assigns to each system the input- 
output relations it determines 


In case X is the space of controllable and observable linear systems with 
given input, state, and output spaces, Y is the orbit space of the action of 
the group of change of bases on state space. 

In case X and Y are finite dimensional vector spaces, the construction 
of (J)' is usually handled via what is called (in statistics and numerical 
analysis) the pseudo-invepse . We shall first recall its definition, then sketch 
a possible generalization to Riemannian manifolds. Then we go on to the case 
(which has evident implications for system theory) where Y is the orbit space 
of the action of a group G on X. The simplest general theory occTors when X 
has a Riemannian metric, and G is a group of -isometpies of this Riemannian 
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metric. In case G is di,sarete, Poincari suggested (in classical work) a 
method of constructing a "fundamental domain", i.e., a slice of the orbit space. 
In [31], the author has suggested a method of generalization of Poincare's 
construction. This construction will be reviewed in this appendix. 

The Pseudo- Inverse 

Suppose that X and Y are finite dimensional real vector spaces with a 
given positive-definite symmetric dot product: 

(Xi,x 2 ) x^ • x^ e R 

(Yi,Y 2) Y^ • Y2 e R 

The norms of x and y are then 

lUll = (x-x)^/" 

IIyII = iy ' 
d(Xi,X2) = llx^-X2ll 
d(Yi,Y2) = llyi-Ysll 

are then metr-Los on X and Y. 

Suppose that ir: X -> Y is a linear map. Let tt*: Y X be the dual map, 
defined by the following relation 

y • TTX = 'ir*y • x 
for X e X, y e Y . 

Let us now minimize the function x ->• |lxl|^, subject to the oonstralnt 
y - TTX = 0. Using the Lagrange multiplier rule, introduce the Lagrange multi- 
plier vector X £ Y and define the function 

f(x,y,X) = l|x||^ + X ■ (y-TTx) 

We now look for the unconstrained critical points of the function x f(x,y,X) 
with (y,X) held fixed: 


58 



dt 


f (x + t6x,y,X) 


t=0 


= 2x • 5x - 2X • TT (6x) 


= 2(x-ir*(A)) • 6x 


X is a critical point of this function if and only if 


X = Tr*(X) 


(A.l) 


The constraint condition gives: 


y = •rrTT*(A) . (A-2) 

Definition . The pseudo-i-nveTses of y under the map it are the points 
X e X of the form (A.l), where A £ Y satisfies (A. 2). 

Remark . Notice that conditions (A.l) and (A. 2) together’ imply that ttx = y. 
There is a pseudo-inverse if and only if irir* is an isomorphism Y ->■ Y. In 
this case, 


y ^ TT* (mr*) ^ (y) = y (y) 

is a map Y -> X which is a cross-section of the map tt: X Y, i.e. , iry = 
identity . 


Maps Between Riemannian Manifolds 

Let X and Y be Riemannian manifolds. Denote the inner product 
tangent vectors as 

(For notation or background of Riemannian geometry, see [22].) If t ^ x(t) , 
a < t < b, is a curve in X, then 



is the length of the curve. The distance d(xi,X 2 > between two points 
xj,X 2 € X is the greatest lower bound of the length of all continuous, precise 
C“ curves joining Xj^ to X 2 - This satisfies all the usual axioms needed to 
define a metric space. 
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Suppose that tt; X Y is a map. Pick a point xq € X which is held 
fixed throughout the discussion. If 2 is a subset of X, let 


d(XQ,Z) = inf d(xQ,z) 
z € Z 


For y € Y, set 


Y(y) 


{x6 X: 


X e ir ^(y) , 


and 


d(x^,x) 


d(x^ 


IT ^(y) ) } 


The points in y(y) aire called the pseudo-'invevs&s of x under the map ir 
(relative to the fixed Riemannian metric) . 

We are especially interested in three questions: 

a) When do there exist points in y(y)? 

b) When does there exist a unique point in y(y)? 

c) If there is a unique point in y(y) each y £ Y, is the map 

y y(y) °f y X (which is a cross-section for the map ir: X ->- Y) 
sufficiently smooth to provide the sort of "robust" liftings of mappings 
A -> Y needed for applications? 

Now, a reasonable sufficient condition for (a) is the following one: 

a') The subset ir~^y) is a closed, regularly e m bedded submanifold of X 
and the Riemannian metric on X is complete. 

In case (a'} is satisfied, and xq lies sufficiently near ir~^(x) (e.g., in 

a tubular neighborhood) , then (b) will be satisfied. However, there is no 
known reasonable condition which will imply (b) . (c) represents even more 

unknown territory. However, the powerful techniques available for handling 
questions like these ("global Riemannian geometry") give some hope that this is 
a construction which might prove useful in certain circiamstances. Certainly, 
it is the natural generalization of the "pseudo- inverse" , which has proved to 
be widely useful and deserves further study. To my knowledge, there is no work 
in the literature in this direction. 


SLICES, FUNDAMENTAL DOMAINS, CANONICAL FORMS, 

ETC. FOR TRANSFORMATION GROUP ACTIONS 

Let X be a manifold, G a Lie group, and Gx X ^ X; (g,x) -> gx, be a 
transformation group action on X. The ovhi-ts of G are the subsets of X of 
the form Gx, with x £ X. An orbit space for G acting on X is another 
space Y (possibly having a more general structure than a manifold) and a map 
it: X -> Y such that the fibers Tr“^(y), for y £ Y, are the orbits of G- 
Suppose such an orbit space is given. 
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A subset sex is a st-iae (or fundamental domatn) for the action of G 
if S meets each orbit of G in precisely one point, i.e., if the following 
condition is satisfied; 

TT restricted to S is one-one and onto. 

Thus, if Y = ( TT restricted to S)“^, y is a cross-section map: Y ->■ X. 

(Conversely, if y: Y X is a cross-section map for it, then S = y(Y) is 
a slice.) 

One usually wants S to have further properties. For example, it would 
be nice if it were a submanifold. (This is usually too much to hope for.) 

The concept of "canonical form" is rather vague and indeterminate in the 
literature. Sometimes it means a slice, sometimes a more general subset which 
touches the "generic" orbit in precisely one point, but may have a larger inter- 
section with certain "singular" orbits. In system theory (where it is the key 
concept in attempts to solve the "identification problem") it is often a sub- 
mantfold of X, which touches a "generic" set of orbits precisely once, but 
does not necessarily touch all orbits. 

In one case, generalizing the classical construction due to Poincare for 
discrete groups [31], there is an elegant and useful definition of a "slice". 
Namely, suppose that X is a complete connected Riemannian manifold, and that 
G is a group of tsometv'Les of X which is a closed siibgroup of the Lie group 
of all isometries of X. It follows [22] that all orbits of G are regularly 
embedded, closed submcinifolds of X, and that each has a tubular neighborhood 
which is invariant under G. Pick a point xg £ X. Set: 

P = {x£X: d(x,x^) ^d(x^,Gx)} 


The proof that P is a slice, and has other nice properties, is given in [22] , 
which also contains further useful general information about the orbits and the 


structure of the orbit space in this case. Finally, notice that P is precisely 
the "generalized inverse" set, as defined above, where Y = orbit space G\X, 
and IT : X Y is the natural quotient map which assigns the orbit Gx to each 
X £ X. 
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